Identification of prosodic attitudes by a temporal recurrent network.

نویسندگان

  • Jean-Marc Blanc
  • Peter Ford Dominey
چکیده

Human speakers modulate the fundamental frequency (F0) of their utterances in order to express different 'prosodic' attitudes such as surprise or curiosity. How are these prosodic attitudes then decoded? The current research addresses the issue of how the temporal structure of F0 can be used in order to discriminate between prosodic attitudes in natural language using a temporal recurrent neural network (TRN) that was initially developed to simulate the neurophysiology of the primate frontostriatal system. In the TRN, a recurrent network of leaky integrator neurons encodes a continuous trajectory of internal states that characterizes the input sequence. The input to the model is a population coding of the continuous, time-varying values of the fundamental frequency (F0) of natural language sentences. We expose the model to an experiment based on one in which human subjects were required to discriminate between different prosodic attitudes (surprise, exclamation, question, etc.). After training, the model discriminates between six prosodic attitudes in new sentences at 82.52% correct, compared to 72.8% correct for human subjects. These results reveal (1) that F0 provides relevant information for prosodic attitude discrimination, and (2) that the TRN demonstrates a categorical sensitivity to this information that can be used for classifying new sentences.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Temporal Structure Classification of Natural Languages by a Recurrent Reinforcement Network

Human infants are sensitive at birth to the contrasting rhythms or prosodic structures of languages, that can serve to bootstrap acquisition of grammatical structure. We present a novel recurrent network architecture that simulates this sensitivity to different temporal structures. Recurrent connections in the network are non-modifiable, while forward connections from the recurrent network to t...

متن کامل

A New Recurrent Fuzzy Neural Network Controller Design for Speed and Exhaust Temperature of a Gas Turbine Power Plant

In this paper, a recurrent fuzzy-neural network (RFNN) controller with neural network identifier in direct control model is designed to control the speed and exhaust temperature of the gas turbine in a combined cycle power plant. Since the turbine operation in combined cycle unit is considered, speed and exhaust temperature of the gas turbine should be simultaneously controlled by fuel command ...

متن کامل

Temporal Processing for Syntax Acquisition: A simulation study

Early perceptual processing capabilities are likely to contribute to the categorization of lexical vs. grammatical words by newborns. This lexical categorization could be performed by detecting differences in the prosodic structure of these word categories. Here we demonstrate that a Temporal Recurrent Network (TRN) that allows realistic treatment of the dynamic temporal aspect of prosody perfo...

متن کامل

Language identification from prosody without explicit features

Most current language identi cation (LID) systems make little or no use of prosodic information, despite the importance of prosody in LID by humans. The greatest obstacle has been that of nding an appropriate feature set which captures linguistically relevant prosodic information. The only system to attempt LID entirely on the basis of prosodic variables uses a set of over 200 features which ar...

متن کامل

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Brain research. Cognitive brain research

دوره 17 3  شماره 

صفحات  -

تاریخ انتشار 2003